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ABSTRACT 

Trichomonas vaginalis Myb3 transcription factor 
(fvMyb3) recognizes the MRE-1 promoter sequence 
and regulates ap65-1 gene, which encodes a hydro- 
genosomal malic enzyme that may play a role in the 
cytoadherence of the parasite. Here, we identified 
so as the essential fragment for DNA rec- 
ognition and report the crystal structure of 
fvMyb3 53 _ 18 o bound to MRE-1 DNA. The N-terminal 
fragment adopts the classical conformation of an 
Myb DNA-binding domain, with the third helices of 
R2 and R3 motifs intercalating in the major groove 
of DNA. The C-terminal extension forms a p-hairpin 
followed by a flexible tail, which is stabilized by 
several interactions with the R3 motif and is not 
observed in other Myb proteins. Interestingly, this 
unique C-terminal fragment does not stably connect 
with DNA in the complex structure but is involved in 
DNA binding, as demonstrated by NMR chemical 
shift perturbation, 1 H- 15 N heteronuclear-nuclear 
Overhauser effect and intermolecular paramagnetic 
relaxation enhancement. Site-directed mutagenesis 
also revealed that this C-terminal fragment is crucial 
for DNA binding, especially the residue Arg 153 and 
the fragment K 170 KRK 173 . We provide a structural 
basis for MRE-1 DNA recognition and suggest a 



possible post-translational regulation of fvMyb3 
protein. 

INTRODUCTION 

Trichomonas vaginalis is one of the most widespread 
sexually transmitted human pathogens. Infection with 
T. vaginalis causes the disease trichomoniasis and is also 
associated with several adverse health consequences, 
including increased HIV transmission and infertility 
(1,2). With the increasing number of drug-resistant 
clinical T. vaginalis strains (3,4), the infection caused by 
T. vaginalis could become a major threat to public health. 
In T. vaginalis, the iron-responsive promoter region of the 
gene ap65-l has two discrete Myb protein-recognition 
sites, MRE-1 /MRE-2r (TA ACGATA , MRE-1 in italic 
and MRE-2r underlined) and MRE-2f (TATCGT), 
interspersed among several closely spaced DNA elements 
(5,6). The ap65-l gene encodes a 65-kDa hydrogenosomal 
malic enzyme that is reputed to be a prominent tricho- 
monad adhesin and plays a role in the cytoadherence of 
T. vaginalis (7,8). Three T. vaginalis Myb transcription 
factors, rvMybl, ?vMyb2 and ?vMyb3 (Figure 1), were 
identified to regulate ap65-l (9-11). The fvMybl and 
rvMyb2 proteins, identified by Southwestern library 
screening with a probe of concatenated MRE-2f se- 
quences, conferred antagonistic actions on basal and 
iron-inducible transcription of ap65-l through dual 
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Figure 1. Sequence alignment of DBD of n>Myb3, rrMybl, ri'Myb2 and c-Myb. Secondary structure of n'Myb3 5 3_ 180 is shown on the top of the 
amino acid sequence. Amino acid numbering for each protein is indicated in the termini of sequence. 



recognition and differential promoter selection toward 
both the MRE-l/MRE-2r and MRE-2f sites (10,11). 
Recently, /vMyb3 protein was identified with use of a 
probe with concatenated MRE-l/MRE-2r sequences. The 
n>Myb3 protein was found to recognize only the MRE-1 
element to activate basal and iron-inducible ap65-l tran- 
scription. These studies demonstrate that /vMyb2 and 
A'Myb3 may coactivate basal and iron-inducible ap65-l 
transcription against fvMybl through conditional and 
competitive promoter entries (9). 

Myb proteins are characterized by a highly conserved 
DNA-binding domain (DBD), the Myb domain (12). Myb 
DBD generally consists of 1^1 imperfect repeats (R), with 
each repeat typically containing approximately 50 amino 
acids divided by three regularly spaced Trp (or hydropho- 
bic) residues that form a hydrophobic core in a 3D struc- 
ture. The repeat forms a three-helix bundle structure, and 
the second and the third helices adopt a helix-turn-helix 
conformation, with the third helix (the recognition helix) 
directly contacting DNA in the major groove (12). In ver- 
tebrates, c-Myb protein was first identified to contain a 
three-repeat R1R2R3 DBD (13). Structural study showed 
that c-Myb R2 and R3 motifs are packed closely with each 
other so that the two recognition helices can coopera- 
tively recognize specific DNA in the major groove. In 
Arabidopsis thaliana, the Myb protein family is large and 
contains more than 190 distinct members, most of which 
contain an R2R3 DBD (126 genes) with at least 40% 
sequence identity (12,14). Recently, Myb DBD from 
plant telomere-binding protein was found to contain a 
C-terminal extension required for binding plant telomeric 
DNA (15,16). Structural studies further revealed that this 
Myb DBD adopts a novel four-helix tetrahedron structure 
and the C-terminal extension forms the fourth helix, 
which, although not interacting with DNA directly, can 
stabilize the overall structure of DBD and is crucial for 
telomeric DNA recognition (17-19). 

The ?vMyb3 protein contains Rl, R2 and R3 motifs and 
shares high sequence homology with M'Mybl, /vMyb2 and 



human c-Myb in the R2 and R3 motifs (Figure 1). 
However, n>Myb3 protein contains a C-terminal exten- 
sion rich in basic amino acid residues (R 168 SKKRK m ), 
which resembles the canonical nuclear localization signals 
(20,21). In the present study, we used electrophoretic mo- 
bility shift assay (EMSA) and isothermal titration calor- 
imetry (ITC) and identified the DBD of n'Myb3 to be the 
fragment spanning residues 53-180, which encompasses 
R2, R3 motifs and a C-terminal extension (designated 
n'Myb3 5 3_i8o). This fragment shows about a 40 times 
increase in MRE-1 DNA binding ability than the shorter 
fragment (?vMyb3 53 _i 6 i), which encompasses a similar 
R2R3 DBD of TvMybl that was shown in our previous 
study to interact with MRE-1 DNA strongly. The crystal 
structure of ?vMyb3 53 _i 8 o in complex with MRE-1 DNA 
was then determined; the C-terminal extension forms a 
P-hairpin conformation followed by a flexible tail, with 
missing electron density for the last 10 residues. The 
roles of C-terminal extension, including the P-hairpin 
and the C-terminal tail, in DNA binding were further 
investigated by NMR and site-directed mutagenesis. Our 
study provides a structural basis for MRE-1 DNA recog- 
nition, identifies a novel P-hairpin conformation in the 
C-terminal extension and suggests a possible regulation 
of MRE-1 element by acetylation or phosphorylation at 
the C-terminal extension of the /vMyb3 DBD. 

MATERIALS AND METHODS 

Sample preparation 

The DNA fragments encoding M'Myb35 3 _i6i and 
?i'Myb3 53 _ 1 8o were cloned into the pET29b (Novagen) 
vector using Ndel and Xhol restriction sites to obtain a 
recombinant protein with a C-terminal His-tag. The 
various mutants of n>Myb3 53 _i 8 o were generated by 
the QuikChange site-directed mutagenesis protocol 
(Stratagene) and confirmed by DNA sequencing. The 
steps for expression and purification of /vMyb3 
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recombinant proteins are similar to those of fvMybl (22). 
All proteins were expressed in Escherichia coli 
BL21(DE3). For labeled ( 15 N, 15 N/ 13 C and 15 N/ 13 C/ 2 H) 
protein samples, the cells were grown in H 2 0- or D 2 0- 
containing M9 minimal medium containing 15 NH 4 C1 
(lg/1) and/or 13 C-glucose (2g/l) at 37°C. The cells were 
lysed by microfluidizer, and the proteins were purified by 
nickel-nitrilotriacetic acid (Ni-NTA) affinity chromatog- 
raphy. Purity and authenticity of the recombinant proteins 
were verified by SDS-PAGE and mass analysis. Finally, 
the target protein was dialyzed and concentrated with 
buffer (20 mM NaH 2 P0 4 , 50 mM NaCl, 0.5 mM NaN 3 
and 10 mM dithiothreitol) at pH 6.0 for study by NMR 
and circular dichroism (CD). The single-stranded DNAs 
of MRE-1, MRE-2f and biotinylated DNAs for surface 
plasmon resonance (SPR) experiments were purchased 
from MDBio Inc. (Taiwan). The double-stranded DNA 
was prepared by mixing equal amounts of two comple- 
mentary deoxynucleotides, heating to 95° C for lOmin 
and cooling slowly to room temperature. 

EMSA 

Oligonucleotides were labeled at the 5'-end with [y- 32 P] 
ATP (Amersham Pharmacia Biotech) by T4 polynucleo- 
tide kinase as described by the supplier (Pr omega). All 
protein samples were incubated with y- 32 P-labeled 16-bp 
MRE-1 or MRE-2f. The mixtures were separated on a 
10% acrylamide gel by electrophoresis. The DNA- 
protein complexes were detected and imaged by use of a 
Typhoon 9410 variable mode imager. 

ITC 

ITC experiments were performed at 25°C with a MicroCal 
iTC200 system (GE Healthcare). The DNA and protein 
samples were dialyzed in phosphate buffer containing 
20 mM NaH 2 P0 4 , 150mM NaCl, 0.5 mM NaN 3 and 
ImM dithiothreitol at pH 6.0. The proteins (30 uM) in 
the calorimetric cell were titrated with 300 uM DNA in 
the injection syringe. Titration was initiated by one 1-ul 
injection followed by 19 injections of 2ul spaced by 180 s 
intervals. Control experiments involved titrating the DNA 
samples into buffer. The raw calorimetry data were col- 
lected and analyzed by use of the software Origin 7.0 
(Microcal Inc., Northampton, MA, USA). The binding 
isotherms were fitted to the one-site binding model, giving 
values of the stoichiometry (ri) of the interaction, the en- 
thalpy of binding ( AH) and the association constant (K^). 

Crystallization and data collection 

Crystallization trials of the /vMyb3 53 „i 80 /16-bp MRE-1 
DNA complex involved the hanging-drop vapor-diffusion 
method. The M>Myb3 53 _i 8 o/DNA complex (1:1 molar 
ratio) was prepared in a buffer containing 20mM 
HEPES, pH 7.0, 50 mM NaCl and ImM dithiothreitol. 
Initial crystals were obtained by use of crystal screen kits 
(Hampton Research) at 4mg/ml. These crystals grew in a 
buffer of 0.1 M sodium cacodylate trihydrate, 0.1 M mag- 
nesium acetate tetrahydrate and 10% w/v PEG8000 at 
pH 6.4. The cryogenic data for the n>Myb3 53 _i 80 /DNA 
crystal were collected by the use of the BL13B1 



beamline of the National Synchrotron Radiation 
Research Center (NSRRC) in Taiwan. Crystal was 
flash-frozen in a liquid-nitrogen stream at 100 K. The 
data were processed and scaled by use of the program 
HKL2000 (23). The crystal diffracted to 2.9 A and 
belongs to the PI space group with unit-cell parameters, 
a = 45.81, b = 71.77, c = 87.82, a = 94.68, p = 97.84 and 
y = 99.28, and R meige 4.6%. The V M was calculated as 
2.71 A 3 Da _1 , corresponding to a solvent content of 
45%, containing four molecules per asymmetric unit. 

Structure determination and refinement 

The crystal structure of the ?vMyb3 53 _i 80 /16-bp MRE-1 
DNA complex was determined by the molecular replace- 
ment method with MOLREP (24). The c-Myb/DNA 
complex (PDB code: 1H8A) with the highest protein 
sequence identity of 50% with A'Myb3 53 _i 8 o/DNA was se- 
lected as the template model. Data between 8.0 and 4.0 A 
and a Patterson radius of 25 A were used for the rotation 
and translation function calculations. Significant rotation 
and translation solutions were obtained for the ?vMyb3 53 _ 
i 80 /DNA. Structural model building involved XtalView 
programs (25) and structural refinement was calculated 
by the use of CNS (26). A bulk solvent mask was cal- 
culated to improve the reflection data. The final model 
for the ?vMyb3 53 _ 180 /DNA contains Lys 53 to Lys 170 in 
one molecule and Lys 53 to Ser 169 in the others. A total 
of 98 water molecules and four DNA molecules were 
located in the electron density map. The model was 
refined to a i?f ac tor of 21.8% for all reflections above 2a 
and a i?f ree of 27.3% using 5.0% randomly distributed 
reflections at 2.9 A. Geometry of the structure checked 
by a Ramachandran plot with use of PROCHECK (27) 
showed 80.4% of protein residues in the most favored 
regions and no residues in disallowed regions. The statis- 
tics of data collection and structure refinement are in 
Table 2. 

SPR 

The real-time sensorgrams of n>Myb3 53 _i 8 o and several 
mutants interacting with a 21 -bp MRE-1 DNA duplex 
were measured at 25°C on a BIAcore 3000 instrument 
(GE Healthcare). The 5'-biotinylated MRE-1 DNA 
duplex dissolved in PBS buffer with a concentration of 
0.02 uM was immobilized to the streptavidin SA sensor 
chip at a flow rate of 10 ul/min. Protein samples were 
injected into the SA sensor chip immobilized with DNA 
at a constant flow rate of 30 ul/min for 3 min for associ- 
ation and then the running buffer containing lOmM 
HEPES, 150mM NaCl, 3.4mM EDTA, pH 7.4, 0.05% 
(v/v) Tween-20 and 1 mM dithiothreitol was applied at the 
same rate for 3 min for dissociation. After each injection, 
the sensor chip was regenerated with 1 M NaCl. For 
analyzing the relative protein binding, the sensorgrams 
were evaluated by use of BIAevaluation v4.1 (BIAcore 
AB, Uppsala, Sweden). All values of the relative binding 
were the averages from three to five independent 
experiments. 
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NMR spectroscopy 

NMR samples (unlabeled or 15 N, 15 N/ 13 C and 15 N/ 13 C/ 2 H 
isotopically labeled proteins) at pH 6.0 in phosphate 
buffer were prepared. Each sample contained ~0.30ml 
of 1-2 mM protein in a Shigemi NMR tube. All NMR 
spectra were acquired with the use of Bruker AVANCE 
600 or 800 MHz spectrometers equipped with a triple 
('H, 13 C and 15 N) resonance cryoprobe, which in- 
cluded a shielded z-gradient at 298 or 310 K. Traditional 
triple-resonance experiments [HNCO, HN(CA)CO, 
CBCA(CO)NH and HNCACB] and transverse 
relaxation-optimized spectroscopy-based triple-resonance 
experiments were used for backbone resonance assign- 
ments of free fvMyb3 53 _i8 0 and JvMyb3 53 _igo in complex 
with DNA, respectively. Side-chain resonance assignment 
followed previously published procedures (28). Nuclear 
Overhauser effect (NOE) data were extracted from 3D 
'H- 15 N NOE spectroscopy (NOESY)-heteronuclear 
single-quantum coherence (HSQC) spectra with 120 ms 
mixing time. The weighted chemical shift perturbations 
(CSPs) for backbone 15 N and 'H N resonances were 
calculated by the following equation: AS = [(A<5 H n) 2 + 
(A<5 N /5) 2 ] 0 ' 5 - 'H- 15 N steady-state heteronuclear NOE 
(HX-NOE) experiments involved a 5 ms separated train 
of 120° l H saturation pulses and a recycle delay of 5 s 
(29). To measure residual dipolar couplings (RDCs) of 
n>Myb3 5 3_i8o in complex with DNA, the filamentous bac- 
teriophage Pfl (Asia Biotech Ltd, Latvia) was selected as 
the orienting medium. 2D 'H-coupled (Fl) IPAP 'H- 15 N 
HSQC spectra (30) were acquired with 256 complex t\ 
(15 N) points and 64 scans per t x increment for both the 
isotropic and anisotropic samples. The quality factor of 
RDC values equals RMS(Z) calc - Z>° bs )/RMS(Z>° bs ), where 
D calc and D° hs are calculated and observed RDC values, 
respectively. Processing of all NMR spectra involved use 
of Bruker Topspin 2.0 (Karlsruhe, Germany) or 
NMRPipe (31), and data were analyzed by use of 
NMR View (32). 

Paramagnetic relaxation enhancement measurements 

To observe the intermolecular paramagnetic relaxation 
enhancement (PRE) effects on fvMyb3s 3 _i8o, an EDTA 
was conjugated with deoxythymidine (dT-EDTA) and 
introduced into DNA as described (33). The DNA frag- 
ments with dT-EDTA were purchased from Yao-Hong 
Biotechnology Inc. (Taiwan). Purity and authenticity of 
individual strands of DNA were verified by chromatog- 
raphy and mass analysis. The double-stranded DNA was 
prepared by mixing equal amounts of two complementary 
deoxynucleotides, heating to 95°C for lOmin and cooling 
slowly to room temperature. The EDTA-conjugated DNA 
duplex was mixed with equal amounts of metal ions (Mn 2+ 
or Ca 2+ ), and ?vMyb3 53 _i 8 o (0.2 mM) was added into the 
mixture at a protein-DNA ratio of 1:1.5. PRE 'H N -r 2 
measurement involved 2D 'H-^N correlated spectra 
with a two time-point delay T of 10 ms as described (34). 
Values of 'H N -r 2 were calculated from peak intensities for 
the diamagnetic (chelated with Ca 2+ ) and paramagnetic 
(chelated with Mn 2+ ) state as described (34). 



RESULTS 

Identification of the DBD of 7vMyb3 

Previously, we mapped the DBD of /vMybl as the frag- 
ment from residue 35 to 141 and determined its solution 
structure as well as its complex model with MRE-1 DNA 
(22). To identify the ?vMyb3 DBD, we generated two 
protein fragments and examined their binding interactions 
with the 16-bp DNA containing the MRE-1 of 5'-ATAAC 
GAT-3' or MRE-2f of 5'-GTATCGTC-3' by EMSA. The 
shorter fragment, /vMyb3s3_i6i, which includes similar 
R2R3 motifs as A>Mybl 35 _i 4 i (Figure 1), is not sufficient 
for binding to both DNAs (Figure 2A). Interestingly, 
n>Myb3 53 _ 180 that contains 19 additional C-terminal 
amino acids and displays similar helical conformation and 
slightly higher thermal stability than the shorter /vMyb3 
(Supplementary Figure SI), exhibits the DNA-binding 
ability. 

To confirm the EMSA results and to quantitate the 
DNA-binding affinity of the two ?vMyb3 fragments, we 
monitored the protein-DNA interactions at 150mM 
NaCl by ITC (Figure 2B). Fitting of ITC thermograms 
showed that /vMyb3 binds with 1:1 stoichiometry to the 
specific MRE-1 or MRE-2f DNA. The formation of the 
7vMyb3/MRE-l complex is enthalpically driven and that 
of the /vMyb3/MRE-2f complex entropically driven. The 
association constant (K^) value of /vMyb35 3 _i8o bound 
to 16-bp MRE-1 was 1.51 ± 0.20 x lO^M" 1 and was 
~10 times weaker for MRE-2f DNA (Table 1). Also, the 
shorter fragment bound to MRE-1 DNA ~40 times 
weaker than the longer ?vMyb3, and the interaction with 
MRE-2f DNA was too weak to be determined by ITC, 
which agrees with the EMSA results. Interestingly, with 
thymine replaced at position 8 by adenine (MRE- 
2f_T8A), which changes the sequence to 5'-GTAAC 
GTC-3' (the mutated base is underlined and the conserved 
bases with MRE-1 are in bold italics), the K a value of 
fvMyb3 53 _i8o bound to MRE-2f_T8A was similar to that 
of the ?vMyb3 53 _i8o/MRE-l complex formation, which 
suggests that /vMyb3 53 _i8o recognizes the sequence 5'-TA 
ACG-3' specifically. The longer fragment, ?vMyb3 53 _i 8 o, 
and the MRE-1 DNA were hence selected for complex 
structure study. 

Crystal structure of rvMyb3 53 _ 18 o bound to MRE-1 DNA 

The overall structures of ?vMyb3 53 _i 8o o bound to a 16-bp 
MRE-1 DNA were determined at 2.9 A (Table 2) by X-ray 
crystallography. The asymmetric unit of the crystal 
contains four protein molecules in complex with four 
DNA duplexes (Figure 3A). The N-terminal parts, from 
residues Lys 53 to Ser 151 , of all four protein structures 
revealed the classical R2R3 Myb DBD, with the third 
helices of R2 and R3 motifs intercalating in the major 
groove of DNA. Interestingly, residues from He 154 to 
Leu 164 form a (3-hairpin conformation, which was not pre- 
viously observed in Myb proteins. Additionally, the 
electron density was not observed for the C-terminal 10 
or 1 1 residues, which suggests that they are flexible in the 
crystals. The superimposition of four protein structures 
for all backbone atoms shows a high degree of structural 
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Figure 2. DNA-binding activity of n'Mybl 35 _| 41 , nMyb3 53 _ l6 i and n'Myb3 53 _| g0 probed by EMSA and ITC. (A) EMSA experiments were performed 
using 16-bp oligonucleotides. Three pico mole Myb proteins were incubated with a Y- 32 P-labeled MRE-1 DNA. Three pico mole n'Mybl 35 _ 141 and 
30pmol A'Myb3 proteins were incubated with a y- 32 P-labeled MRE-2f DNA. All protein-DNA mixtures were separated by 10% PAGE. (B) ITC 
thermograms for sequential 2-ul additions of 16 bp MRE-1 DNA (300 uM) to fvMyb proteins (30 uM). The lines were fitted to the integrated heats 
using a single-site binding model. 



Table 1. Thermodynamic parameters of Myb DBDs binding to DNAs 



Sample 


n 


K a (M- 1 ) 


AG (kcal moP 1 ) 


AH (kcal moP 1 ) 


AS (cal moP 1 KT 1 ) 


MRE-1 (16 bp) 












«vMybl 35 _i4i 


1.01 ±0.08 


1.48 (±0.45) x 10 s 


-11.14 


-24.18 ±0.09 


-43.77±0.55 


n>Myb3 53 _ 161 


0.98 ±0.05 


3.40 (±0.43) x 10 5 


-7.54 


-8.75 ±0.07 


-4.07 ±0.02 


/vMyb3 5 3_ 18 o 


0.99 ±0.05 


1.51 (±0.20) x 10 7 


-9.79 


-11.41 ±0.04 


-5.44 ±0.23 


AAAA 


1.10 ±0.01 


2.86 (±0.46) x 10 6 


-8.80 


-10.85±0.14 


-6.87 


MRE-1 (21 bp) 












n>Myb3 53 _ 18 o 


1.09 ±0.00 


6.02 (±0.96) x 10 7 


-10.62 


-13.00±0.05 


-7.99 


S169D 


1.01 ±0.00 


2.67 (±0.41) x 10 7 


-10.14 


-12.93±0.08 


-9.38 


AAAA 


1.01 ±0.00 


5.12 (±0.41) x 10 6 


-9.16 


-13.21 ±0.07 


-13.60 


MRE-2f_T8A (16 bp) 












nMyb3 53 _ 1S0 


1.05±0.03 


1.17 (±0.59) x 10 7 


-9.60 


-6.69 ±0.08 


9.77± 1.32 


MRE-2f (16 bp) 












«vMybl 35 _i4i 


1.10 ±0.09 


1.17 (±0.14) x 10 s 


-11.01 


-23.24 ±0.24 


-41.03 ±0.96 


nMyb3 53 _ 16 i 


ND b 


ND b 


ND b 


ND b 


ND b 


n'Myb3 53 . 180 


1.03 ±0.13 


1.39 (±0.32) x 10 6 


-8.387 


-3.49 ±0.22 


16.4±0.95 



"AG = -RT'lnK a , R = 1.985 cal-K^'moP 1 , gas constant. 
b ND, not determined due to the small AH. 



similarity, with RMSD of 0.47 ±0.05 A for backbone 
atoms. However, fitting only the backbone atoms of the 
R2 and R3 motifs (Lys 53 to Ser 151 ) showed that the 
RMSD for this region can be lowered to 0.35 ±0.03 A, 
but the deviation of the C-terminal residues (Lys 152 to 
Ser 169 ) was increased to 0.95 ±0.19 A (Supplementary 
Figure S2), which implies that the structure of the 
C-terminal fragment in the complex state is relatively 
more flexible than that of R2 and R3 motifs. 

To select a representative complex structure, we deter- 
mined the one-bond RDC constant, 'D NH , of n'Myb3 53 _i 8 o 
in complex with DNA by NMR spectroscopy in solution. 
With the four crystal structures used as templates, the 
theoretical 'D NH RDC values can be calculated. For the 
residues in secondary structural regions, the experimental 
RDC values fit well to the theoretical values from four 
protein structures with quality factors between 0.37 and 
0.43 (Supplementary Figure S3). However, the experimen- 
tal RDC values of all residues fit best to the values from 



the fourth complex structure with a quality factor of 0.65, 
which suggests that the fourth complex structure in crys- 
talline state is similar to that in solution state. 

In the fourth complex structure, ?vMyb35 3 _i 8 n consists of 
six a-helices, a (3-hairpin and a C-terminal tail (Figure 3B), 
with ocl (Glu 58 to Asn 71 ), a2 (Trp 76 to Ser 81 ), a3 (Pro 88 to 
Leu"), a4 (Pro 110 to Lys 122 ), a5 (Trp 127 to Leu 133 ), a.6 
(Asp 139 to Ser 151 ), (31 (He 154 to Thr 156 ) and p2 (Glu 162 to 
Leu 164 ). The three-helix bundles of R2 and R3 motifs are 
stabilized by hydrophobic and electrostatic interactions. 
Two hydrogen bonds (Val 104 and Gly 136 , Asn 97 and 
Thr 138 ) and 1 van der Waals interaction between the 
side chains of Val 104 and Arg 137 span the R2 and R3 
motifs. Notably, the C-terminal fragment makes several 
contacts with the R3 motif (Figure 3C). Leu 164 on 02 
takes part in the formation of the R3 hydrophobic core. 
Arg 153 and Asp 167 form a salt bridge network with Asp 113 
on a4, Arg 145 on a6 and 02P of THY5. In addition, O' of 
Glu 162 on 02 constitutes a hydrogen bond with H'' 12 of 
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Arg 118 on oc4. The C-terminal P-hairpin conformation 
seems to be stabilized by these interactions. 

Structural basis for MRE-l/MRE-2r recognition by 

*vMyb3 5 3_i 8 o 

In the representative ?vMyb353 _i 8u /DNA complex structure, 
the recognition helices (helices 3 and 6) insert into the major 
groove of the MRE-1 DNA duplex. All interactions 
between protein and DNA are illustrated in Figure 3D. 
The side chains of Lys 89 , Glu 93 in helix 3 and Asn 140 , Lys 143 
and Asn 144 in helix 6 form specific hydrogen bonds with the 
bases of GUA9, CYT8, ADE7, GUA8' and ADE6, respect- 
ively (Supplementary Figure S4). Moreover, several 
residues, including Gin 74 , Gin 90 , Arg 92 , Arg 94 and His 98 
in the R2 motif and Trp 108 , Ser 125 , Trp 127 and Ser 128 in the 
R3 motif contact with the phosphate groups of the DNA 
backbone in the major groove. Additionally, the backbone 
atoms of Gly 54 , Pro 55 and Phe 56 interact with the phos- 
phate groups of ADE7 and CYT8 in the minor groove. 
However, there are no direct contacts between DNA and 
the P-hairpin or the C-terminal tail. The nearest distance is 
4.42 A between N" 1 of Arg 168 and OIP of GUA3. 



In the preceding result of identification of the /vMyb3 
DBD, the DNA-binding affinity of ?vMyb3 53 _ 180 is 40 
times stronger than that of ?vMyb353_i6i- Therefore, the 
observation of no interaction between DNA and the 
C-terminal residues of ?vMyb3 5 3_i 8 o in the X-ray structure 
is a surprise. The roles of the C-terminal fragment, 
including the P-hairpin, the C-terminal tail and the 
missing residues, in DNA binding need further investiga- 
tion as described below. 

NMR investigations of the C-terminal fragment 

We observed no electron density for the C-terminal 10 
residues 'K 171 RKAADVPKK 180, in the crystal structure 
of the ?vMyb3 5 3_i 8 o/DNA complex, which may due to its 
high flexibility. However, the DNA length in the crystal 
may be too short for contacting with the last 10 residues. 
Hence, in addition to the 16-bp DNA, 21-, 25- and 30-bp 
DNAs were titrated into 15 N-labeled rvMyb3 5 3_i 80 to 
characterize their interactions by NMR spectroscopy. 
2D 'H- 1s N HSQC spectra showed backbone amide reson- 
ances of several residues in complex with 21 -bp DNA sig- 
nificantly different from those in complex with 16-bp 
DNA (Figure 4A, left panel), and longer DNA did not 
cause substantial further perturbations in these residues 
(Figure 4A, right panel), which suggests that the 21-bp 
MRE-1 sequence is suitable for fvMyb3 5 3_ 180 /DNA 
complex formation. 

The backbone resonance assignments of ?vMyb3 53 _i8o 
in the free state and in complex with 16- and 21-bp 
DNAs were achieved by several 3D triple-resonance ex- 
periments (Supplementary Figure S5). Backbone reson- 
ances of residues in the R2 and R3 motifs (Lys 53 to 
Ser 151 ) undergo similar CSPs in complex with 16- and 
21-bp DNAs (Figure 4B). Most of the largely perturbed 
residues (AS> A(5 average + SD = ~0.5ppm) in the R2R3 
fragment are located at the N terminus, the loop connect- 
ing helixes 1-3, the loop connecting R2 and R3 motifs, 
and helix 6 (Figure 4C). One additional strong CSP was 
observed for Ser 128 located in helix 5. For the residues 
located at the C-terminal fragment (Lys 152 to Lys 180 ), 
the complex sample with 21-bp DNA shows significantly 
larger CSP values than those with 16-bp DNA, which 
suggests that these residues have stronger interactions 
with the longer DNA than the shorter one. 

We then checked the backbone rigidity of ?vMyb3 5 3_i 80 
in the absence and presence of DNA by ! H- 15 N HX-NOE 
measurements (Figure 4D). HX-NOE values of almost all 
/i'Myb3 53 _i 80 residues were significantly increased in the 
presence of DNA; especially the N- (Lys 53 to Thr 57 ) and 
C-terminal residues (He 154 to Lys 180 ), which are highly 
flexible in the absence of DNA, adopt a more rigid con- 
formation on binding to DNA. The average HX-NOE 
values of the R2 and R3 motifs (Lys 53 to Ser 151 ) in the 
absence and presence of 16- and 21-bp DNAs were 
0.78 ±0.08, 0.84 ±0.07 and 0.85 ±0.08, respectively. 
However, the average values of the C-terminal fragment 
(Lys 152 to Lys 180 ) were increased from 0.40 ±0.26 for the 
free state to 0.52±0.18 for the complex with 16-bp DNA 
and 0.57±0.18 for binding to 21-bp DNA. Although the 
C-terminal fragment of M>Myb3 5 3_i 8 o was still flexible, the 
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Figure 3. Crystal structures and interactions of fvMyb3 53 _ 180 in complex with 16-bp MRE-1 DNA. (A) There are four proteins in complex with four 
DNA duplexes in the asymmetric unit of the crystal. (B) The representative complex structure, in which, (\ , Myb3 5 3_ 180 consists of six a-helices, a 
(3-hairpin and a C-terminal tail. The electron density for the C-terminal 10 residues was not observed, and the residues from Glu 162 to Lys are 
colored in red. (C) The hydrophobic, electrostatic and hydrogen-bonding interactions between the R3 motif and C-terminal p-hairpin. The involved 
residues in the C-terminal P-hairpin are labeled. Two 02P atoms of ADE4 and THY5 are shown. The salt bridges are linked with dashed lines in 
magenta and hydrogen bond in blue. (D) Schematic representation illustrating the contacts between n>Myb3 5 3_ 18 o and MRE-1 DNA. The amino acid 
residues in R2 and R3 motifs are shown with green and blue background colors, respectively. The van der Waals contacts and hydrogen-bonding 
interactions are indicated by orange and black dotted lines, respectively. The specific AACG base pairs are colored in pink. The phosphates are 
indicated in yellow circles. 



amide pico-second to nanosecond time scale motions 
indeed are stabilized by the extension of DNA length. 

The results from NMR CSPs and backbone HX-NOE 
values reveal that the interactions between DNA and the 
C-terminal fragment, including the last 10 residues missing 
in the X-ray structure, were increased when the DNA 
length was extended from 16 to 21 bp. So we tried to crys- 
tallize the /vMyb3 53 _ 180 /21-bp MRE-1 DNA complex 
sample, but the resolution of resulting crystals was too 
low to determine the structure. Instead, we used intermo- 
lecular PRE to investigate the interactions between DNA 
and the C-terminal region of £vMyb353_igo. We purchased 
the 21 -bp DNA duplex containing the EDTA-derivatized 
deoxythymidine (dT-EDTA) at the desired site. Values of 
PRE of amide protons ('H N -r 2 ) for /vMyb3 5 3_i 80 were 
determined from a two time-point measurement in both 
the paramagnetic (chelated with Mn 2+ ) and diamagnetic 
(chelated with Ca 2+ ) states (33,34). The residues exhibiting 
1 H N -r 2 >5s~ 1 are located at the N terminus, the loop 



connecting the R2 and R3 motifs, helix 6 and the 
C-terminal tail (Figure 5A). A structural model of 
/vMyb3 5 3_i 8 o in complex with a 21-bp DNA duplex was 
generated by extending the 16-bp DNA in the crystal 
structure with a standard B-form conformation. We 
mapped the intermolecular PRE 1 H N -F 2 values on the 
structural model (Figure 5B), and found that most of 
the PRE 'H N -r 2 values correlated well with the distances 
measured in the model structure, with 'H N -r 2 between 14 
and 10 s -1 corresponding to the distance from 18 to 25 A 
and 'H N -r 2 between 10 and 5s _1 corresponding to the 
distance from 24 to 29 A (35). The C-terminal residue 
Arg 168 exhibits the largest 'H N -r 2 value of 13.3 s" 1 , 
which suggests that it is close to the DNA although it 
does not form any direct contact with DNA in the 
crystal structure. Also, the PRE values of the C-terminal 
12 residues, like Ser 169 and Lys 170 , which should be in 
regions closest to Mn 2+ if they formed stable contacts 
with the extended DNA, are all >5s _1 but smaller than 
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Figure 4. NMR investigations on tvMyb3 53 _ 18 o/DNA complexes. (A) The overlaid 2D 'H- 15 N HSQC spectra of tvMyb3 53 _ 1S0 in complex with 16-bp 
(red, left panel), 21-bp (blue, left and right panels) and 25-bp (purple, right panel) MRE-1 DNAs. (B) The overlay of the 'H- I5 N HSQC spectra of 
free A'Myb3 53 _ 180 (black) and in complex with 21-bp MRE-1 DNA (blue) at 310K. The residues with large chemical shifts are indicated. (C) 
Weighted CSPs for backbone l5 N and ! H N resonances of tvMyb353_ i #o in complex with 1 6-bp (red ) or 2 1 -bp (blue) DNA as calculated by the 
following equation: AS = [(A5 HN ) 2 + (A5 N /5) 2 ] 0 5 - (D) ! H- 15 N heteronuclear NOEs of tvMyb353-180 in free form (black circles), and in complex with 
16-bp (red circles) or 21-bp (blue circles) MRE-1 DNAs. The secondary structures are highlighted in (C and D). 



that of Arg 168 , which suggests that these residues are close 
to DNA but their mobility reduces the observed PRE 
values. 

NMR studies shows that the C-terminal extension is 
flexible but sensitive to the length of interacted DNA, sug- 
gesting that it is involved in DNA binding. The increased 
intermolecular PRE values of C-terminal missing residues 
further imply that these residues have transient inter- 
actions with DNA. 

Mutations on the C-terminal fragment 

From the CSPs, backbone HX-NOE and intermolecular 
PRE, the C-terminal fragment (Lys 152 to Lys 180 ) was 
found to be highly involved in DNA binding, although 
it does not form stable contacts with DNA in the crystal 
structure. To further validate the roles of the C-terminal 
residues in DNA binding, we generated various mutants 
of fvMyb3 5 3_i8o and monitored their interactions with 
DNA by SPR at 150mM NaCl (Figure 5C and 
Supplementary Figure S6). The mutants R153A and 
L164A, which disrupt the electrostatic and hydrophobic 
interactions between the P-hairpin and R3 motif, show a 
significant decrease in DNA binding ability. Interestingly, 
the mutants that change one positively charged residue 
at the C terminus to alanine bind to DNA as strongly 
as does the wild-type protein. However, the mutants 
AA1 (K170A, Kl 71 A) and AA2 (R172A, K173A) show 



notable decreases, and AAAA (K170A, K171A, R172A, 
K173A) exhibits severely reduced DNA binding activity. 

The association constants (K a ) of the AAAA mutant 
bound to 16- and 21-bp MRE-1 DNAs at 150mM NaCl 
were measured by ITC (Table 1). For 21-bp DNA, the 
affinity between DNA and the AAAA mutant (5.12 ± 
0.41 x 10 6 M _I ) was ~12 times weaker than that of the 
wild-type protein (60.2 ±9.6 x 10 6 M _1 ). For 16-bp DNA, 
the reduction in association constant was ~ 5-fold for the 
mutant protein. In addition, the K a value of the S169D 
mutant bound to 21-bp DNA was about half of that of the 
wild-type protein (Table 1). The interactions between 
DNA and S169D, AA1, AA2 or AAAA mutants were 
all weaker than that between DNA and wild-type 
protein, so acetylation at the Lys residue or phosphoryl- 
ation at the Ser residue in the C-terminal fragment will 
lower the binding affinity between fvMyb3 53 _i 80 and 
DNA. 

Thus, the structural integrity of the R3 motif and the 
P-hairpin and the positively charged residues at the C 
terminus are critical for DNA binding. 

The p-hairpin conformation is preserved in free 
fvMyb3 53 _ 18 o 

According to 3D structure comparison by the Dali server, 
the C-terminal p-hairpin conformation of /vMyb3 53 _i 8 o is 
novel as compared with other Myb-like DBD. Hence, 
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Figure 5. Intermolecular PRE for (\'Myb3 53 _ 180 /DNA complex and DNA binding activities of (\ , Myb3 5 3_ 180 mutants. (A) EDTA labeled site on 
21-bp MRE-1 DNA (in red) and intermolecular PRE 'H N -r 2 values obtained with EDTA chelating with Mn 2+ . (B) The intermolecular PRE values 
are mapped on a structural model of n>Myb3 53 _ 180 in complex with a 21-bp DNA duplex, which is generated by extending the 16-bp DNA in the 
crystal structure with standard B-form conformation. EDTA and Mn 2+ are shown in magenta. The residues exhibiting 'H N -r 2 values >10s are in 
red, 5-10 s~ in orange and <5s~' in blue. The missing C-terminal 10 residues are indicated with one-letter abbreviations. (C) DNA binding activities 
of n'Myb3 53 _ 180 mutants probed with SPR. The relative DNA binding activities of mutants are calculated from SPR responses during 15 s before the 
end of protein injection. 



determining whether this unique structure is formed only 
in the DNA-bound form or is preserved in the free form is 
of interest. From backbone HX-NOE experiments of free 
/vMyb3 5 3_i8o (Figure 4D), the residues in the P-strands, 
especially Leu 164 and Leu 165 , exhibit backbone HX-NOE 
values larger than those of N- and C-terminal residues, 
which suggests that this region adopts a conformation 
more stable than a random coil. As well, 15 N-edited 
NOESY-HSQC spectra revealed several long-range 
NOEs between these C-terminal residues (Arg 153 to 
Leu 165 ), like the NOEs between H a of Thr 156 and H N of 
Lys 161 -Ile 163 (Figure 6A). The backbone NOE connectiv- 
ity among these residues suggests that this fragment 
adopts a hairpin conformation in free n'Myb3 53 _i 8 o 
(Figure 6B). 

DISCUSSION 

The C-terminal p-hairpin extension is unique and crucial 
for MRE-1 DNA recognition 

Myb protein, a member of the transcription factor family, 
represents a functional DBD that is conserved in a wide 
range of eukaryotic systems. c-Myb protein was first 
identified in vertebrates and found to contain a three- 
repeat R1R2R3 DBD. Each repeat of c-Myb forms a 
three-helix bundle conformation, but only the third 
helices of R2 and R3 motifs recognize specific DNA in 
the major groove (13). In plants, most of the double- 
stranded telomere-binding proteins contain only one 
Myb-like domain. However, this Myb-like domain alone 



is not sufficient for telomere binding. A C-terminal exten- 
sion was found to be highly involved in DNA binding 
(15,16). Structural studies revealed that instead of adopting 
the typical fold of Myb DBD, the Myb-like domain of 
plant double-stranded telomere-binding protein folds 
into a four-helix tetrahedron structure, and the fourth 
helix, which is formed by the C-terminal extension, par- 
ticipates in the formation of a hydrophobic core and 
stabilizes the whole DBD structure for telomere binding 
(17-19). We determined the structure of ?vMyb3 53 _i 8 o in 
complex with MRE-1 DNA and showed that the 
C-terminal extension forms a p-hairpin conformation, 
which was demonstrated by NMR to participate in 
DNA binding and by site-directed mutagenesis to be cru- 
cial for DNA binding. Also, NMR studies indicate that 
the C-terminal p-hairpin conformation is formed in the 
DNA-bound state as well as in free protein. The unique 
features of this C-terminal P-hairpin conformation are 
described below. 

First, like the fourth helix in the Myb-like domain of 
plant double-stranded telomere-binding protein, the 
C-terminal P-hairpin conformation of /vMyb3 53 _i8 0 , 
although not making any direct contact with DNA, 
plays a role in stabilizing the overall structure and is 
critical for DNA binding. This observation was confirmed 
by determining the thermal stability, whereby A>Myb3 53 _i 80 
is 3°C more stable than the shorter fragment n>Myb3 53 _i 6 i, 
and by the analysis of the mutants R153A and L164A, 
which disrupt the electrostatic and hydrophobic interactions 
between the P-hairpin and R3 motif, showing a significant 
decrease in DNA binding ability. The one exception is the 
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Figure 6. The C-terminal fragment of /i'Myb3 53 _ 18 o forms a P-hairpin conformation without DNA. (A) Strip plots from 15 N-edited NOESY-HSQC 
spectrum of free fvMyb3 5 3_i g0 , in which NOE cross-peaks between of Thr 156 and H N of Asn 157 -Ile 63 are connected by a black line. (B) NOE 
connectivity of residues from Arg 153 to Leu 165 . Strong, medium and weak NOEs are indicated by solid-thick, solid and solid-thin arrows, respect- 
ively. The dashed arrows indicate the ambiguous NOEs, which cannot be clearly identified because of overlapping. 



mutant D167A, which, although breaking the salt bridge 
networks among Asp 167 , Arg 1 " and Arg 145 (Figure 3C), 
showed an increase in DNA binding activity. The mutant 
D167A may reduce the repulsion forces between O 51 of 
Asp 167 and 02P of ADE4, which is 4.4- A apart in the 
crystal structure, and the salt bridge networks can be sup- 
plemented by the interactions between N^ 2 of Arg 145 and 
02P of THY5, which is 4.1 -A apart in the crystal structure. 

Second, the C-terminal tail, which is mostly not 
observed in the complex structure, is also important for 
DNA binding, which suggests that the DNA recognition 
by ?vMyb3 may be regulated by post-translational modi- 
fication at these residues. Our NMR studies demonstrated 
that the C-terminal tail, although flexible, has some inter- 
actions with the extended DNA. As well, the AA1, AA2 
and AAAA mutants, which replaced the positively 
charged residues in the C-terminal tail to alanine and 
S169D, which introduces a negative charge to mimic phos- 
phorylation on Ser residue, all show a significant decrease 
in DNA binding affinity. In the complex structure, the 
third helix of the R3 motif contacts with DNA, and the 
C-terminal basic residues are restricted by the P-hairpin to 
be in a position next to the third helix of the R3 motif, 
where these basic side chains may have transient inter- 
actions with the DNA phosphate backbone or the minor 
groove. According to this view, the interaction mode 
shows similarity with the C-terminal fragment of 
another DBD, the winged helix or the forkhead box 
(termed FOX). The FOX motif consists of two wings 
(Wl and W2), three a-helices (al, oc2 and a3) and three 
B-strands (Bl, P2 and P3), arranged in order of al-pl-oc2- 
a3-p2-Wl-p3-W2. Like the C-terminal half of 
?vMyb3 53 _i g o, the second and third helices of FOX 
adopt a helix-turn-helix conformation, P2 and p3 forms 
an anti-parallel p-sheet and the loop or wing W2 extends 
from P3 to the C terminus of the DBD. In DNA recogni- 
tion by canonical FOX proteins, the third helix makes 



direct contact with DNA in the major groove and the 
wing Wl interacts with DNA in the minor groove (36). 
In the FOX03a/DNA complex structure, the C-terminal 
W2 basic residues forms hydrogen bonds with the DNA 
phosphate backbone (37). For FOXOl protein, the 
flexible C-terminal fragment is not observed in the 
complex structure (38). However, the C-terminal coil is 
necessary for DNA binding in both FOX03a and 
FOXOl proteins, and their DNA binding activities are 
attenuated by phosphorylation of Ser or acetylation of 
Lys residues in the C-terminal coil (37,38). According to 
our study of /vMyb3 and its similarity with FOX protein, 
MRE-1 element recognition by fvMyb3 protein may be 
regulated by phosphorylation or acetylation of the 
C-terminal residues. 

Structural comparison of the R2R3 DBD with other Myb 
proteins 

The /vMyb3 protein exhibits high amino acid sequence 
identity with human c-Myb and ?vMybl in the R2R3 
DBD (Figure 1). The structure of c-Myb DBD in 
complex with DNA has been reported (13,39). In our 
previous study, we determined the solution structure of 
free ?vMybl 35 _ 141 and proposed the /vMybl 35 _ 141 / 
MRE-1 DNA complex model using CSPs, residual 
dipolar couplings, DNA specificity data and data-driven 
macromolecular docking by HADDOCK (22). We found 
that the orientation between R2 and R3 motifs of /vMybl 
greatly changes on binding to DNA to recognize the DNA 
major groove through a number of contacts involving 
residues in two recognition helices. A superimposition of 
the six helices from ZvMyb3 R2R3 DBD with those from 
Zi'Mybl and c-Myb showed high similarity in backbone 
structure, with RMSD of 0.91 and 1.79 A for C a atoms, 
respectively (Supplementary Figure S7), so the three R2R3 
DBDs may adopt a similar architecture of protein 
backbone to recognize the DNA major groove. 
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For the residues in two recognition helices, /vMyb3 
protein shows 83% amino acid sequence identity with 
human c-Myb. The R2R3 DBD of c-Myb recognizes the 
base sequence 5'-TAACNG-3' specifically and that of 
/vMyb3 binds to the MRE-1 element containing 5'-TAA 
CGA-3' with high specificity. High similarities in both 
protein and DNA sequences suggest that the protein- 
DNA interactions of /vMyb3 and c-Myb are similar. 
Indeed, close examination of the two protein-DNA 
complex structures reveals that most of the residues 
identified for DNA recognition in nMyb3 can also be 
found in the c-Myb/DNA complex (Figure 3D and 
Supplementary Figure S8A). One difference is that Lys 89 
in helix 3 forms specific hydrogen bonds with the bases of 
GUA9 in the fvMyb3 53 _igo/DNA complex structure, but 
the corresponding Lys residue (Lys 128 ) in c-Myb does not 
contact with the corresponding GUA (GUA21) but rather 
with GUA22. Also, the residues Trp 108 , Gly 54 and Gin 74 
in ?vMyb3 contact with the DNA phosphate backbone, 
and these interactions are not conserved in the c-Myb/ 
DNA complex. 

The amino acid sequence identity for the residues in two 
recognition helices between ?vMyb3 and ?vMybl is only 
57%. ^vMybl recognizes the whole MRE-1 element con- 
taining 5'-TAACGAT-3', whereas /vMyb3 binds only to 
5'-AACG-3'. The examination of two protein-DNA inter- 
actions reveals that three specific hydrogen bonds with 
bases and four hydrogen bonds with a phosphate 
backbone in the fvMyb3 53 _i 8 o/DNA complex structure 
can be found in the ?vMybl 35 _ 141 /DNA complex model 
(Figure 3D and Supplementary Figure S8B). However, 
two Lys residues in ?vMyb3, Lys 89 and Lys 143 , which rec- 
ognize bases specifically, are Arg residues in fvMybl 
(Arg 71 and Arg 25 ) and form hydrogen bonds with bases 
of ADE10 and ADE11' and the phosphate backbone of 
CYT9', respectively, in the complex model. In addition, 
the specific interactions of Asn 6 and Gin 72 identified in 
the /vMybl 35 _ 141 /DNA complex are not found in the 
/vMyb3 complex structure. The corresponding residue of 
Asn 69 in ?vMyb3 is Ser 87 , which does not recognize the 
base of ADE10, perhaps because of its shorter side 
chain. However, without the crystal structure of the 
/vMybl 35 _i4i/DNA complex, confirming the interactions 
observed in the model is not easy. The comparison of 
/vMybl and ?vMyb3 complexes suggests that the 
strategy that combined the free form structure with 
CSPs, residual dipolar couplings, DNA specificity data 
and data-driven macromolecular docking can generate a 
model with reliable protein backbone architecture and 
some conserved protein-DNA interactions. This method 
is an alternative way to obtain the structural basis of 
protein-DNA interactions if the crystal structure is not 
available. 



ACCESSION NUMBER 

The atomic coordinates of ?vMyb3 53 _ 180 /16-bp MRE-1 
DNA complex were deposited in the Protein Data Bank 
under accession code 3zqc. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 



ACKNOWLEDGEMENTS 

We are grateful for the access to the synchrotron radiation 
beamline 13B1 at the National Synchrotron Radiation 
Research Center (NSRRC) in Taiwan. The NMR 
spectra were obtained at the High-Field Nuclear 
Magnetic Resonance Center (HFNMRC) supported by 
the National Research Program for Genomic Medicine. 
We thank Dr Shu-Chuan Jao of the Biophysics Core 
Facility, Scientific Instrument Center at Academia Sinica 
for providing technical assistance of ITC experiments. We 
also thank Dr Chi-Fon Chang for setting up the experi- 
ments for PRE measurements and Laura Smales for 
copyediting the manuscript. 



FUNDING 

Academia Sinica and the National Science Council, 
Taiwan, ROC (NSC 95-2320-B-001-040-MY2 to C.C. 
and NSC98-2311-B-001-009-MY3 to C.-D.H.). Funding 
for open access charge: Academia Sinica. 

Conflict of interest statement. None declared. 



REFERENCES 

1. Sorvillo,F., Smith,L., Kerndt.P. and Ash,L. (2001) Trichomonas 
vaginalis, HIV, and African-Americans. Emerg. Infect. Dis., 1, 
927-932. 

2. Soper,D. (2004) Trichomoniasis: under control or 
undercontrolled? Am. J. Obstet. Gynecol.. 190, 281-290. 

3. Cudmore,S.L., Delgaty,K.L., Hayward-McClelland.S.F., 
Petrin.D.P. and Garber.G.E. (2004) Treatment of infections 
caused by metronidazole-resistant Trichomonas vaginalis. 
Clin. Microbiol. Rev.. 17, 783-793. 

4. Dunne, R.L., Dunn,L.A., Upcroft.P., O'Donoghue.P.J. and 
UpcroftJ.A. (2003) Drug resistance in the sexually transmitted 
protozoan Trichomonas vaginalis. Cell Res., 13, 239-249. 

5. Ong,SJ., Huang.S.C, Liu.H.W. and TaiJ.H. (2004) Involvement 
of multiple DNA elements in iron-inducible transcription of the 
ap65-l gene in the protozoan parasite Trichomonas vaginalis. 
Mol. Microbiol, 52, 1721-1730. 

6. Tsai,C.D., Liu,H.W. and TaiJ.H. (2002) Characterization of an 
iron-responsive promoter in the protozoan pathogen Trichomonas 
vaginalis. J. Biol. Chem., 277, 5153-5162. 

7. Alderete,J.F., O'BrienJ.L., Arroyo, R., EngbringJ.A., 
Musatovova.O., Lopez.O., Lauriano,C. and Nguyen,J. (1995) 
Cloning and molecular characterization of two genes encoding 
adhesion proteins involved in Trichomonas vaginalis 
cytoadherence. Mol. Microbiol., 17, 69-83. 

8. Garcia.A.F., Chang,T.H., Benchimol.M., Klumpp.D.J., 
Lehker,M.W. and AldereteJ.F. (2003) Iron and contact with host 
cells induce expression of adhesins on surface of Trichomonas 
vaginalis. Mol. Microbiol, 47, 1207-1224. 

9. Hsu,H.M., Ong,S.J., Lee.M.C. and TaiJ.H. (2009) 
Transcriptional regulation of an iron-inducible gene by differential 
and alternate promoter entries of multiple Myb proteins in the 
protozoan parasite Trichomonas vaginalis. Eukaryot. Cell, 8, 
362-372. 

10. Ong,S.J., Hsu,H.M., Liu.H.W., Chu,C.H. and TaiJ.H. (2006) 
Multifarious transcriptional regulation of adhesion protein gene 
ap65-l by a novel Mybl protein in the protozoan parasite 
Trichomonas vaginalis. Eukaryot. Cell, 5, 391—399. 



460 Nucleic Acids Research, 2012, Vol. 40, No. 1 



11. Ong,S.J., Hsu,H.M., Liu.H.W., Chu,C.H. and Tai.J.H. (2007) 
Activation of multifarious transcription of an adhesion protein 
ap65-l gene by a novel Myb2 protein in the protozoan parasite 
Trichomonas vaginalis. J. Biol. Chem., 282, 6716-6725. 

12. Dubos,C, Stracke,R., Grotewold.E., Weisshaar,B., Martin, C. and 
Lepiniec,L. (2010) MYB transcription factors in Arabidopsis. 
Trends Plant Sci., 15, 573-581. 

13. Ogata,K., Morikawa,S., Nakamura,H., Sekikawa,A., Inoue,T., 
Kanai,H., Sarai.A., Ishii,S. and Nishimura.Y. (1994) Solution 
structure of a specific DNA complex of the Myb DNA-binding 
domain with cooperative recognition helices. Cell, 79, 639-648. 

14. Stracke.R., Werber,M. and Weisshaar.B. (2001) The R2R3-MYB 
gene family in Arabidopsis thaliana. Citrr. Opin. Plant Biol, 4, 
447-456. 

15. Chen.C.M., Wang,C.T. and Ho.C.H. (2001) A plant gene 
encoding a Myb-like protein that binds telomeric GGTTTAG 
repeats in vitro. /. Biol. Chem., 276, 16511-16519. 

16. Karamysheva.Z.N., Surovtseva,Y.V., Vespa,L., Shakirov,E.V. and 
Shippen,D.E. (2004) A C-terminal Myb extension domain defines 
a novel family of double-strand telomeric DNA-binding proteins 
in Arabidopsis. J. Biol. Chem., 279, 47799^17807. 

17. Ko,S., Yu,E.Y., Shin,J., Yoo,H.H., Tanaka,T., Kim,W.T., 
Cho,H.S., Lee,W. and ChungJ.K. (2009) Solution structure of the 
DNA binding domain of rice telomere binding protein RTBP1. 
Biochemistry, 48, 827-838. 

18. Ko.S., Jun.S.H., Bae,H., ByunJ.S., Han,W., Park,H., Yang,S.W., 
Park,S.Y., Jeon,Y.H., Cheong,C. et al. (2008) Structure of the 
DNA-binding domain of NgTRFl reveals unique features of 
plant telomere-binding proteins. Nucleic Acids Res., 36, 
2739-2755. 

19. Sue.S.C, Hsiao,H.H., Chung.B.C.P., Cheng.Y.H., Hsueh,K.L., 
Chen,C.M., Ho,C.H. and Huang.T.H. (2006) Solution structure 
of the Arabidopsis thaliana telomeric repeat-binding protein DNA 
binding domain: a new fold with an additional C-terminal helix. 
J. Mol. Biol, 356, 72-85. 

20. Boulikas,T. (1993) Nuclear localization signals (NLS). 
Crit. Rev. Eukaryot. Gene Expr., 3, 193—227. 

21. Lange,A., Mills,R.E., Lange,C.J., Stewart, M., Devine,S.E. and 
Corbett,A.H. (2007) Classical nuclear localization signals: 
definition, function, and interaction with importin alpha. 

J. Biol. Chem., 282, 5101-5105. 

22. Lou,Y.C, Wei.S.Y., Rajasekaran,M., Chou,C.C, Hsu,H.M.. 
TaiJ.H. and Chen,C. (2009) NMR structural analysis of DNA 
recognition by a novel Mybl DNA-binding domain in the 
protozoan parasite Trichomonas vaginalis. Nucleic Acids Res., 37, 
2381-2394. 

23. Otwinowski.Z. and Minor,W. (1997) Processing of X-ray 
diffraction data collected in oscillation mode. Methods Enzymol., 
216, 307-326. 

24. Vagin,A. and Teplyakov,A. (1997) MOLREP: an Automated 
Program for Molecular Replacement. /. Appl. Crvstallogr., 30, 
1022-1025. 

25. McRee.D.E. (1999) XtalView/Xfit-A versatile program for 
manipulating atomic coordinates and electron density. 

J. Struct. Biol., 125, 156-165. 



26. Brunger,A.T., Adams.P.D., Clore,G.M., DeLano,W.L., Gros,P., 
Grosse-Kunstleve,R.W., JiangJ.S., KuszewskiJ., Nilges,M., 
Pannu,N.S. et al. (1998) Crystallography & NMR system: a new 
software suite for macromolecular structure determination. 
Acta Crvstallogr. D Biol. Crvstallogr., 54, 905-921. 

27. Laskowski,R.A., MacArthur,M.W., Moss,D.S. and Thornton.J.M. 
(1993) PROCHECK: a program to check the stereochemical 
quality of protein structures. J. Appl. Crystallogr., 26, 283-291. 

28. Lin,T.H., Chen,C, Huang,R.F., Lee,Y.L., ShawJ.F. and 
Huang,T.H. (1998) Multinuclear NMR resonance assignments 
and the secondary structure of Escherichia coli thioesterase/ 
protease I: a member of a new subclass of lipolytic enzymes. 
/. Biomol. NMR, 11, 363-380. 

29. Barbato,G, Ikura,M., Kay,L.E., Pastor,R.W. and Bax,A. (1992) 
Backbone dynamics of calmodulin studied by 15N relaxation 
using inverse detected two-dimensional NMR spectroscopy: the 
central helix is flexible. Biochemistry, 31, 5269-5278. 

30. Ottiger,M., Delaglio,F. and Bax.A. (1998) Measurement of J and 
dipolar couplings from simplified two-dimensional NMR spectra. 
/. Magn. Reson., 131, 373-378. 

31. Delaglio,F., Grzesiek,S., Vuister.G.W., Zhu,G., PfeiferJ. and 
Bax,A. (1995) NMRPipe: a multidimensional spectral processing 
system based on UNIX pipes. J. Biomol. NMR, 6, 277-293. 

32. Johnson,B.A. and Blevins,R.A. (1994) NMR View: A computer 
program for the visualization and analysis of NMR data. 

/. Biomol. NMR, 4, 603-614. 

33. IwaharaJ., Anderson,D.E., Murphy,E.C. and Clore,G.M. (2003) 
EDTA-derivatized deoxythymidine as a tool for rapid 
determination of protein binding polarity to DNA by 
intermolecular paramagnetic relaxation enhancement. 

/. Am. Chem. Soc, 125, 6634-6635. 

34. IwaharaJ., Tang,C. and Clore,G.M. (2007) Practical aspects of 
H-l transverse paramagnetic relaxation enhancement 
measurements on macromolecules. J. Magn. Reson., 184, 185-195. 

35. Donaldson,L.W., Skrynnikov,N.R., Choy,W.Y., 
Muhandiram,D.R., Sarkar,B., Forman-KayJ.D. and Kay,L.E. 
(2001) Structural characterization of proteins with an attached 
ATCUN motif by paramagnetic relaxation enhancement NMR 
spectroscopy. /. Am. Chem. Soc, 123, 9843-9847. 

36. Gajiwala,K.S. and Burley,S.K. (2000) Winged helix proteins. 
Curr. Opin. Struct. Biol., 10, 110-116. 

37. Tsai,K.L., Sun,Y.J., Huang,C.Y., YangJ.Y., Hung.M.C. and 
Hsiao,C.D. (2007) Crystal structure of the human 
FOX03a-DBD/DNA complex suggests the effects 

of post-translational modification. Nucleic Acids Res., 35, 
6984-6994. 

38. Brent.M.M., Anand,R. and Marmorstein,R. (2008) Structural 
basis for DNA recognition by FoxOl and its regulation by 
posttranslational modification. Structure, 16, 1407-1416. 

39. Tahirov,T.H., Sato,K., Ichikawa-Iwata,E., Sasaki,M., Inoue- 
Bungo,T., Shiina,M., Kimura,K., Takata,S., Fujikawa.A., 
Morii,H. et al. (2002) Mechanism of c-Myb-C/EBP 

beta cooperation from separated sites on a promoter. Cell, 108, 
57-70. 



